November 11, 2016 // WSN, Monterey, California



data_final_v2.xls



Re: FWD: data question






Doing reproducible science

  • Better science in less time using data science tools

    • Lowndes et al., in prep
    • reproducibility is fundamental, but rarely tested
    • tools have changed how we do science
      • reproducibility, collaboration, communication

Data science

Data science

Conflated questions

Science:

  • what is X's habitat preferences?
  • when is Y most abundant?
  • how wide is their distribution?

Data science:

  • how do I import my data?
  • how do I subset the years I want?
  • how do I visualize this?

Data science tools

Ocean Health Index

a method to score benefits that oceans provide to people

Ocean Health Index

a method to score benefits that oceans provide to people

Reproducible workflow

Reproducible workflow

Reproducible and open workflow

You can do better science in less time:

Get to your science questions sooner

1. Learn to code
    - in R
    - with RStudio

2. Use version control
    - git
    - with GitHub
    - through RStudio







You can do better science in less time:

Get to your science questions sooner

1. Learn to code
    - in R
    - with RStudio

2. Use version control
    - git
    - with GitHub
    - through RStudio

3. Learn in an intentional way

  • in a panic feel empowered
  • for a single purpose think ahead
  • in isolation with a community

Great resources

Thank you

to the OHI team, colleagues, #rstats community

Ocean Health Index

a method to categorize benefits that oceans provide to people

scores are modeled using existing data; data intensive

method can be tailored to different geographies
can help inform policy decisions, especially when repeated

Conflating questions

We could reproduce our science questions:

  • how does X change over time?
  • how does X change with Y?
  • how to model Z?


Data science:

  • how to subset just the years we want?
  • how to transpose columns to rows?
  • how to do this reproducibly?

OHI Global Assessments

We expected to easily reproduce our previous work. We had planned ahead:

  • coded models
  • 130 pages of published supplemental material
  • internal documents and notes

OHI Global Assessments

We expected to easily reproduce our previous work. We had planned ahead:

  • coded models
  • 130 pages of published supplemental material
  • internal documents and notes




…But we struggled to reproduce our work.